Annotation upon Annotation: Adding Signalling Information to a Corpus of Discourse Relations

نویسندگان

  • Maite Taboada
  • Debopam Das
چکیده

We present an annotation effort that involves adding a new layer of annotation to an existing corpus. We are interested in how rhetorical relations are signalled in discourse, and thus begin with a corpus already annotated for rhetorical relations, to which we add signalling information. We show that a very large number of relations carry signals that can help identify them as such. The detailed, extensive analysis of signals in the corpus can aid research in the automatic parsing of discourse relations.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An annotation scheme for Persian based on Autonomous Phrases Theory and Universal Dependencies

A treebank is a corpus with linguistic annotations above the level of the parts of speech. During the first half of the present decade, three treebanks have been developed for Persian either originally or subsequently based on dependency grammar: Persian Treebank (PerTreeBank), Persian Syntactic Dependency Treebank, and Uppsala Persian Dependency Treebank (UPDT). The syntactic analysis of a sen...

متن کامل

An Annotation System for Development of Chinese Discourse Corpus

Well-annotated discourse corpora facilitate the discourse researches. Unlike English, the Chinese discourse corpus is not widely available yet. In this paper, we present a webbased annotation system to develop a Chinese discourse corpus with much finer annotation. We first review our previous corpora from the practical point of view, then propose a flexible annotation framework, and finally dem...

متن کامل

Signalling Subject Matter and Presentational Coherence Relations in Discourse: a Corpus Study

In this study, we examine how subject matter and presentational coherence relations in Rhetorical Structure Theory (Mann and Thompson 1988) are signalled in written discourse, and whether they differ quantitatively or qualitatively in terms of the signalling devices involved. By signalling we mean textual signals (discourse markers such as although, because and thus, and also signals such as te...

متن کامل

Corpus Annotation of Macro Discourse Structures

We present our discourse annotation project, ANNODIS, which aims to make available a diversified French corpus annotated with discourse information, along with a set of tools for annotation and corpus exploitation. An original aspect of the project is that it combines two theoretically and methodologically different points of view on discourse: bottom-up and top-down. In the bottom-up perspecti...

متن کامل

Text as Scene: Discourse Deixis and Bridging Relations

This paper presents a new framework, “text as scene”, which lays the foundations for the annotation of two coreferential links: discourse deixis and bridging relations. The incorporation of what we call textual and contextual scenes provides more flexible annotation guidelines, broad type categories being clearly differentiated. Such a framework that is capable of dealing with discourse deixis ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • D&D

دوره 4  شماره 

صفحات  -

تاریخ انتشار 2013